CDS

Accession Number TCMCG075C13717
gbkey CDS
Protein Id XP_007034160.2
Location complement(join(25533368..25533481,25533647..25533767,25534327..25534473,25534639..25534744,25534865..25534947,25535029..25535156,25535250..25535336,25535424..25535534,25535811..25535978,25536086..25536170,25536254..25536370,25536456..25536532,25536654..25536725,25537884..25537980,25538063..25538265,25538351..25538453,25538548..25538615,25538727..25538780,25538876..25538985,25540060..25540138,25540230..25540292,25541688..25541807,25542512..25542568,25542661..25542777,25544386..25544436,25544520..25544618,25544920..25545065,25547237..25547447))
Gene LOC18602604
GeneID 18602604
Organism Theobroma cacao

Protein

Length 997aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007034098.2
Definition PREDICTED: probable UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase SEC [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category GOT
Description UDP-N-acetylglucosamine--peptide N-acetylglucosaminyltransferase SEC
KEGG_TC -
KEGG_Module -
KEGG_Reaction R09304        [VIEW IN KEGG]
R09676        [VIEW IN KEGG]
KEGG_rclass RC00005        [VIEW IN KEGG]
RC00059        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01003        [VIEW IN KEGG]
ko03036        [VIEW IN KEGG]
KEGG_ko ko:K09667        [VIEW IN KEGG]
EC 2.4.1.255        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00514        [VIEW IN KEGG]
ko04931        [VIEW IN KEGG]
map00514        [VIEW IN KEGG]
map04931        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGCTCTCCTTGCAGAGCGATCCTCGGCTGCAACAGTACCATCATAGCCAGCAGCTTCAACAACAACTACAGCAGCAACAGGTTCAATTGGTTCCATACAACGATGACTCACTGAGTCTGCACTCTGATTTTGGAGGCGCCGTTGCTGCTGCTTCTTCTTCTTCTGCTTTGGTTAATCTCAAGCCCTCTCAGGGTTTGGACTCCCATGAAGTTGATGATGACACACTCATGGCCCTTGCTCATCAAAAGTACAAGGCTGGGAACTACAAGCATGCTTTAGAACACAGCAATGCAGTCTATGAGAGGAACCCACATCGTACTGATAATTTGCTTCTCCTGGGTGCAATTCATTATCAGTTGCATAATTATGATCAATGCATTGCAAAGAATGAAGAAGCTCTTAGAATTGATCCACAATTTGCTGAGTGCTATGGAAATATGGCAAATGCTTGGAAGGAGAAAGGCAATATTGATGCTGCAATCCGGTATTATTTGTTTGCTATCGAGCTTCGACCGAATTTTGCTGATGCATGGTCAAATCTAGCTAGTGCATACATGCGGAAAGGGAGGCTTAATGAGGCAGCCCAGTGTTGTCGCCAGGCGCTTGCATTAAATCCCCGTTTGGTTGATGCTCATAGTAACCTTGGCAATTTAATGAAAATTCAAGGTTTTGTGCAAGAGGCTTACAGTTGCTACCTTGAGGCTCTTCGTATACAACCCAATTTCGCAATTGCATGGTCTAATCTTGCTGGGCTTTTCATGGAAGCTGGGGATCTTAACAGGGCACTTCAATACTATAAGGAAGCAGTGAGACTTAAACCAACATTTTTTGATGCCTACCTAAACCTTGGAAATGTTTATAAGGCTCTGGGAATGCCTCAAGAGGCTATTGTATGCTATCAGCGTGCTCTTCAGGTCCGACCGGATTATGCTATGGCCTATGGCAATTTGGCAAGTATTTATTACGAACAGCGTAACCTGGATATGGCAATTCTCAATTATAGGAGAGCAATTGCTCTTGACTCAGGATTTTTGGAGGCATATAACAATTTGGGTAATGCTTTGAAAGATGCTGGAAGAGTTGATGAAGCAACACAATGCTATAGGCAATGTCTTGCCTTGCAACCTAACCATCCTCAGGCACTTACAAACCTTGGGAATATATATATGGAATGGAATATGTTGACTGCTGCTGCTTCATGCTACAAGGCAACTTTATCTGTGACAACAGGACTTTCTGCTCCTTTCAACAATTTAGCAATCATTTACAAACAGCAGGGCAATCTCTCAGATGCTATATCTTGTTACAATGAAGTTCTGCGTATTGATCCTATGGCCGCTGATGCACTTGTCAACCGGGGGAACACATATAAGGAGAGTGGAAGAGTAAATGAAGCCATTCAAGATTACATACGAGCTATTAACATTAGGCCAGCCATGGCTGAAGCTCATGCAAATTTGGCTTCGGCTTATAAGGACAGTGGACATGTTGAGGCTGCAATAAAAAGCTACAAGCAAGCACTGGCTCTTCGCCCTGATTTTCCAGAAGCAACCTGTAACCTTCTACATACATTACAGTGTGTTTGTGACTGGGAGGATCGAGAGAATAAATTTATTGAGGTTGAAGGCATACTCAGGAGACAGATTAAGATGTCTGTTATTCCTAGCGTGCAGCCTTTCCATGCAATAGCCTATCCAATTGATCCAGTGCTTGCACTAGATATCAGTCGTAAATATGCGGCACACTGCTCTGTTATTGCATCTCGTTATTCACTTGCTCGTTTCAACTATCCTGCACCCTTCCCTGTGAAGAGTGAGAATGGGAATGGACGCTTAAGGGTGGGATATGTGAGTAGTGATTTTGGCAACCATCCCCTATCTCATCTCATGGGCTCAGTCTTTGGCATGCACAATAGAGAAAATGTTGAGGTATTCTGCTATGCATTGAGTCCAAATGATGGAACAGAATGGAGGTTGCGTATCCAGTCTGAAGCAGAGCACTTCATAGATGTATCATCCATGTCCTCTGACATCATTGCAAAGATGATAAATGAGGATAAAATACAAATTCTTGTCAATCTTAATGGCTATACGAAGGGGGCAAGGAATGAGATATTTGCTATGCAACCTGCTCCTATTCAGATTTCTTACATGGGATTTCCTGGGACTACTGGTGCATCATATATACACTATTTGGTCACTGATGAGTTCGTCTCACCTCTTCGTTTTTCTCATATCTACTCTGAGAAGCTTGTTCACCTTCCTCATTGTTACTTTGTAAATGATTATAAGCAGAAAAATCGTGATGTCTTGGATCCCAAGTGCTTGCCTAAGAGATCTGATTATGGATTACCAGAAGACAAATTTATCTTTGCATGTTTCAATCAGCTGTACAAGATGGATCCTGACATTTTCACCACATGGTGCAATATTCTTAAGCGTGTTCCCGATAGTGCTCTTTGGCTTCTTAGATTCCCAGCTGCAGGCGAGATGAGACTTCGCACATATGCAACTCAGCAGGGTGTGCGGCCGGATCAGATTATATTTACAGATGTTGCCTTGAAAAGTGAACATATAAGACGCAGTGCCTTGGCAGATCTCTTCCTTGATACACCATTATGCAATGCGCATACAACAGGCACTGATGTTTTATGGGCTGGTCTTCCAATGGTGACCCTTCCACTTGACAAGATGGCGACTAGAGTTGCTGGTTCCTTGTGTTTGGCTACTGGTGTCGGGGAGGAGATGATTGTCAGCTGTTTGAAAGAATACGAAGAGAAGGCTGTCTCACTTGCTCTAAATCGTCCAAAGCTCCAGGATCTTTCTAATAAACTCAAAGAAGCCCGTATGACTTGCCCTCTTTTTGACACATTACGCTGGGTGAGGAACCTTGAACGAGCATATTTTAAGATGTGGAATCTATGCTGCTTAGGTCATCAACCACAACCCTTTAAAGTGACGGAGAGTGATCAAGAATTTCCTTATGATAGATAG
Protein:  
MLSLQSDPRLQQYHHSQQLQQQLQQQQVQLVPYNDDSLSLHSDFGGAVAAASSSSALVNLKPSQGLDSHEVDDDTLMALAHQKYKAGNYKHALEHSNAVYERNPHRTDNLLLLGAIHYQLHNYDQCIAKNEEALRIDPQFAECYGNMANAWKEKGNIDAAIRYYLFAIELRPNFADAWSNLASAYMRKGRLNEAAQCCRQALALNPRLVDAHSNLGNLMKIQGFVQEAYSCYLEALRIQPNFAIAWSNLAGLFMEAGDLNRALQYYKEAVRLKPTFFDAYLNLGNVYKALGMPQEAIVCYQRALQVRPDYAMAYGNLASIYYEQRNLDMAILNYRRAIALDSGFLEAYNNLGNALKDAGRVDEATQCYRQCLALQPNHPQALTNLGNIYMEWNMLTAAASCYKATLSVTTGLSAPFNNLAIIYKQQGNLSDAISCYNEVLRIDPMAADALVNRGNTYKESGRVNEAIQDYIRAINIRPAMAEAHANLASAYKDSGHVEAAIKSYKQALALRPDFPEATCNLLHTLQCVCDWEDRENKFIEVEGILRRQIKMSVIPSVQPFHAIAYPIDPVLALDISRKYAAHCSVIASRYSLARFNYPAPFPVKSENGNGRLRVGYVSSDFGNHPLSHLMGSVFGMHNRENVEVFCYALSPNDGTEWRLRIQSEAEHFIDVSSMSSDIIAKMINEDKIQILVNLNGYTKGARNEIFAMQPAPIQISYMGFPGTTGASYIHYLVTDEFVSPLRFSHIYSEKLVHLPHCYFVNDYKQKNRDVLDPKCLPKRSDYGLPEDKFIFACFNQLYKMDPDIFTTWCNILKRVPDSALWLLRFPAAGEMRLRTYATQQGVRPDQIIFTDVALKSEHIRRSALADLFLDTPLCNAHTTGTDVLWAGLPMVTLPLDKMATRVAGSLCLATGVGEEMIVSCLKEYEEKAVSLALNRPKLQDLSNKLKEARMTCPLFDTLRWVRNLERAYFKMWNLCCLGHQPQPFKVTESDQEFPYDR